Joint Modeling of Accents and Acoustics for Multi-Accent Speech Recognition

نویسندگان

  • Xuesong Yang
  • Kartik Audhkhasi
  • Andrew Rosenberg
  • Samuel Thomas
  • Bhuvana Ramabhadran
  • Mark Hasegawa-Johnson
چکیده

The performance of automatic speech recognition systems degrades with increasing mismatch between the training and testing scenarios. Differences in speaker accents are a significant source of such mismatch. The traditional approach to deal with multiple accents involves pooling data from several accents during training and building a single model in multi-task fashion, where tasks correspond to individual accents. In this paper, we explore an alternate model where we jointly learn an accent classifier and a multi-task acoustic model. Experiments on the American English Wall Street Journal and British English Cambridge corpora demonstrate that our joint model outperforms the strong multi-task acoustic model baseline. We obtain a 5.94% relative improvement in word error rate on British English, and 9.47% relative improvement on American English. This illustrates that jointly modeling with accent information improves acoustic model performance.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multi-accent and accent-independent non-native speech recognition

In this article we present a study of a multi-accent and accentindependent non-native speech recognition. We propose several approaches based on phonetic confusion and acoustic adaptation. The goal of this article is to investigate the feasibility of multi-accent non-native speech recognition without detecting the origin of the speaker. Tests on the HIWIRE corpus show that multi-accent pronunci...

متن کامل

Speech Recognition of South African English Accents

Declaration By submitting this thesis electronically, I declare that the entirety of the work contained therein is my own, original work, that I am the sole author thereof (save to the extent explicitly otherwise stated), that reproduction and publication thereof by Stellenbosch University will not infringe any third party rights and that I have not previously in its entirety or in part submitt...

متن کامل

FOREIGN ACCENT CLASSIFICATION USING SOURCE GENERATOR BASED PROSODIC FEATURES - Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on

Speaker accent is an important issue in the formulation of robust speaker independent recognition systems. Knowledge gained from a reliable accent classification approach could improve overall recognition performance. In this paper, a new algorithm is proposed for foreign accent classification of American English. A series of experimental studies are considered which focus on establishing how s...

متن کامل

Acoustic and Lexical Modeling Techniques for Accented Speech Recognition

Speech interfaces are becoming pervasive among the common public with the prevalence of smart phones and cloud-based computing. This pushes Automatic Speech Recognition (ASR) systems to handle wide range of environments including different channels, noise conditions and speakers with varying accents. This thesis focuses on the impact of speakers’ accents on the ASR models and techniques to make...

متن کامل

Acoustic modelling of English-accented and Afrikaans-accented South African English

In this paper we investigate whether it is possible to combine speech data from two South African accents of English in order to improve speech recognition in any one accent. Our investigation is based on Afrikaans-accented English and South African English speech data. We compare three acoustic modelling approaches: separate accent-specific models, accentindependent models obtained by straight...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1802.02656  شماره 

صفحات  -

تاریخ انتشار 2018